AN EXPLICIT EXAMPLE OF OPTIMAL PORTFOLIO-CONSUMPTION 
CHOICES WITH HABIT FORMATION AND PARTIAL OBSERVATIONS 
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Abstract. We consider a model of optimal investment and consumption with both habit for- 
mation and partial observations in incomplete Ito processes markets. The individual investor de- 
velops addictive consumption habits gradually while only observing the market stock prices but 
not the instantaneous rate of return, which follows an Ornstein-Uhlenbeck process. Applying the 
Kalman-Bucy filtering theorem and Dynamic Programming arguments, we solve the associated HJB 
equation explicitly for this path dependent stochastic control problem in the case of power utility 
preferences. We provide the optimal investment and consumption policies in explicit feedback forms 
using rigorous verification arguments. 



1. Introduction 

Habit formation has become a popular alternative tool for modeling preferences on optimal con- 
sumption streams during recent years. It has been observed that the time separable property of 
traditional von Neumann-Morgenstern utility is not consistent with many empirical experiments, 
for instance, the celebrated Equity Premium Puzzle, (see Mehra and Prescott [TS] and Constan- 
tinides [i4] ) . The financial economics literature has been arguing that the past consumption pattern 
has a continuing impact on individual's current consumption decisions, and the preference should 
depend on both the consumption rate process and the corresponding consumption history. In par- 
ticular, the consumption path dependent linear habit formation preference 'E[J^ U{t,ct — Zt)dt\ 
has been widely accepted, where the index Zt stands for the accumulative consumption history. 
The definition that instantaneous utility function is decreasing in Zt indicates that an increase in 
consumption today increases current utility but depresses all future utilities through the induced 
increase in future standards of living. 

The continuous time utility maximization problem with linear addictive habit formation in the 
complete Ito processes market has been extensively studied in the past decades, see for instance, 
Detemple and Zapatero [S] and [B], Schroder and Skiadas [2D], Munk [IT] and Englezos and Karatzas 
[7]. Recently, this continuous time path dependent optimization problem has been solved in the 
general incomplete semimartingale market by Yu [23j, using the path-dependence reduction trans- 
form and convex duality analysis. 

The contributions of the present work are two-fold. 1). From the modeling perspective, we 
are considering the habits forming investor in incomplete Ito processes financial markets, together 
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with additional partial observations constraint. We are facing the case that the individual investor 
develops his own consumption habits during the whole investment period and meanwhile has only 
access to the public stock price information. In other words, he can not observe the mean rate 
of return process Ht and the corresponding Brownian motion Wt which appears in the stock price 
dynamics. In our model, we will assume nt follows the mean reverting Ornstein Uhlenbeck process 
driven by another Brownian motion Bt which is not perfectly correlated with Wt- Subject to the 
loss of information and filtration shrinkage, this stochastic control problem can not be covered 
by the main theorems in Yu [23], which motivates the separate work of this paper. 2). On the 
other hand, at the mathematical level, we solve the relatively complicated nonlinear HJB equation 
fully explicitly. As a consequence, we furthermore derive the J-"(^-adapted optimal investment and 
consumption polices in feedback form via rigorous verification arguments. Our analytical approach 
allows us to avoid proving the Dynamic Programming Principle and the measurable selection ar- 
guments associated with it. 

Optimal investment problems under incomplete information have been studied by numerous au- 
thors, and we only list a very small subset of them: Lakner |14J applies martingale methods and 
derived the structure of the optimal investment strategies. The linear diffusion model is studied by 
Brendle [2], who derives explicit results for the value of information on optimal investment with 
power and exponential utilities. The effects of learning on the composition of the optimal portfolios 
are studied in Brennan [3] and Xia [22] . Monoyios [16] considers the optimal investment with both 
the uncertainty of the drift parameter and the inside information of the the Brownian motion at 
terminal time, for which he obtains an explicit solution via Kalman-Bucy filtering together with 
techniques of enlargement of the filtration. Bjork, Davis and Landen [T] examine the market model 
with unobservable rates of returns that are arbitrary semimartingales, and they provide a unified 
treatment for a large class of partially observed investment problems. 

To the best of our knowledge, the utility maximization with consumption habit formation under 
incomplete information is not yet addressed in the literature. However, we want to single out the 
work of Munk [17] and Brendle [2] which are technically close to our problem. Munk [T7j tackles the 
utility maximization with consumption habit formation in the complete market, where he assumes 
the market price of risk process obeys a mean reverting SDE, moreover, the stock price process and 
the drift process are driven by the same Brownian motion. By applying the Market Isomorphism 
result by Schroder and Skiadas [20] to the work by Wachter [21], he obtains the explicit solution 
for the HJB equation and optimal control policies under power utility preference for the case p < 0. 
However, the complete market setting is too restrictive. On the other hand, Brendle [2] treats the 
problem of utility maximization on the terminal wealth in the incomplete Ito process market with 
partial observations. He figures out the solution of the HJB equation can be expressed in a closed 
form by solving some ODE systems with time dependent parameters. Moreover, by setting some 
technical substitutions, he proves that solving the previous ODEs is actually equivalent to solving 
a family of ODEs with constant parameters, whose solutions have been discussed earlier by Kim 
and Omberg [13j in a different problem setting. When the intertemporal consumption choice comes 
into play, together with path dependent habit formation impact, it is not clear whether the HJB 
equation can still admit an explicit solution. Our contribution can also be summarized as that 
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we combine the two models considered by the previous authors, and successfully solve the Munk^s 
habit formation problem in the setting of Brendle^s incomplete market with partial observations. 
Furthermore, we provide the rigorous verification of the main results which is missing in their pre- 
vious work. 

The structure of the present paper is outlined as: Section [2] introduces the market model and the 
concept of habit formation process. The utility maximization problem with addictive habit formation 
and partial observations is defined in Section [3j By applying the Kalman Bucy filtering theorem and 
Dynamic Programming arguments, we formally derive the Hamilton-Jacobi-Bellman(HJB) equa- 
tion for the power utility preference and we provide the decoupled form solution of this nonlinear 
PDE, which reduces the algorithm to solving some auxiliary ODEs with constant coefficients. Based 
on these classical solutions, the explicit feedback form of the optimal investment and consumption 
policies will be obtained. Section H] contains rigorous proofs of the corresponding verification argu- 
ments. At last, four cases of fully explicit solutions of some auxiliary ODEs are presented in the 
Appendix HI 

2. Market Model and Consumption Habit Formation 

On a probability space (r2,F,P) equipped with the background filtration F = {J^t)o<t<T, which 
satisfies the usual conditions, we consider a financial market with one risk-free bond and one stock 
account for a "small investor" over a finite time horizon [0, T]. The price of the bond solves: 

dS^ = nS'^dt, 0<t<T 

with initial price = 1, and without loss of generality, we assume the interest rate = 0, for all 
t G [0,T], this can be achieved by the standard change of numeraire. 
The stock price St is modeled as a diffusion process solving: 

(2.1) dSt = fitStdt + asStdWt, < t < T, 

with So = s > 0, where the drift process fit is F adapted, and satisfies the mean-reverting Ornstein 
Uhlenbeck SDE: 

(2.2) dfit = -A(Ait - fL)dt + (T^dBt, 0<t<T. 

Here, Wt and Bt are F adapted Brownian motions defined on (il,F,P) and they are correlated with 
the coefficient p £ [—1, 1]. We assume the initial value of the drift process fiQ is an J^q measurable 
Gaussian random variable, satisfying ~ N(7/o,^o)) which is independent of Brownian motions 
(Wt)o<t<T and {Bt)o<t<T- We also assume all the coefficients CT5 > 0, A > 0, /i, cr^ > are constants. 

Remark 2.1. Under the full background filtration F, we do not assume the existence of the equiva- 
lent local martingale measures in the market, for example, the exponential local martingale deflator 
defined by 

(2.3) Ht = exp f - /* ^dW^ -- f ^dv), 0<t<T. 

^ Jo (^S 2 Jo cr| / 

is allowed to be a strict local martingale. 
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Now, at each time t S [0, T], the investor chooses a consumption rate q > 0, and decides the 
amounts TTf of his wealth to invest in the stock, and the rest in bank. Then, in this self-financing 
market model, the investor's total wealth process Xt follows the dynamics: 

(2.4) dXt = {TTtfit - ct)dt + asTTtdWt, 0<t<T, 

with the initial wealth Xq = xq > 0. 

In this paper, we adopt the notation Zt = Z{t; ct) as "Habit Formation" process or "the standard 
of living" process to describe his "consumption habits level" . We assume the accumulative index 
Zt follows the dynamics: 

(2.5) dZt = {5{t)ct - a{t)Zt)dt, 0<t<T, 

where Zq = z > is called the initial habit. 
Equivalently, (|2.5p stipulates 

,.t 

(2.6) Zt = ze-^o"W'^«+ / <5(u)e-^X'')'=''^c„dn, 0<t<T, 

Jo 

and it is the exponentially weighted average of the initial habit and the past consumption. Here, 
the discounting factors a{t) and d{t) measure, respectively, the persistence of the past level and the 
intensity of consumption history, and are assumed to be nonnegative continuous functions.. 

In our current work, we are only considering the case of "addictive habits", i.e., we require 
investor's current consumption strategies shall never fall below the standard of living level, 

(2.7) Ct >Zt, VO < t < T, a.s.. 

We will see in the future that this additional consumption constraint implies the initial wealth must 
be sufficiently large to sustain habits and ensure the existence of optimal policies. 



3. Utility Maximization with Kalman-Bucy Filtering 

3.1. Dynamic Programming Arguments under Partial Observations Filtration F*^. From 
now on, we shall make the assumption that the investor can observe the stock price process St 
which is published and available to the public, however, the drift process nt and the information 
of Brownian motions {Wt)o<t<T and {Bt)o<t<T are unknown. We shall call this as the "partial 
observations information" scenario. This investment and consumption optimization problem with 
incomplete information will be modeled by requiring the investment strategy ('7rj)o<t<T and con- 
sumption policy {ct)o<t<T be only adapted to the partial observation filtration F"^ = {J-'t)o<t<T 
where J-t = (t{Su : < u < t}, which is strictly smaller than the background full information 
IF = (-^t)o<t<T- 

Applying the famous Kalman-Bucy stochastic filtering theorem, we can first define the Innova- 
tion Process in our market model as: 



(3.1) dWt = — (/it - fit)dt + asdWt 



1 /dSt . , , 
— i^-p'tdt, 0<t<T, 
crs^ St 
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which is a Brownian motion under the partial observations filtration , where the process fit 
E 



Ait 



-pS 



is the conditional estimation of drift process ^t- 
Moreover, by the same Kalman-Bucy filtering theorem, the process jit satisfies the linear SDE: 

Vtt + as<Tfj,p^ 



dfit 



X{fit - lj)dt + 



(3.2) 



with /io 



dWt. 



fitdt + Xfidt + 



0<t<T, 



E 







r/o- So we can solve for ft as the strong solution of the SDE (|3.2p by 



knowing the stock price process St and Clt 

E 



And the conditional variance Of 

1 



(3.3) d^t = 
with n{0) = E (no- r]f 

(3.4) Of = J7(t;0o 
where: 



{^^t - ft] 



satisfies the deterministic Riccati ODE: 



0-5 



2X)nt + {l- p')a 



dt, < t < T, 



■pS 



9q, which has an explicit solution as: 
fciexp(2(^)t) + fc2 



as- 



fciexp(2(^)t)-A:2 



f\ , 2 
(A + -f^)cT5, 



< t < T, 



A^<t|5 + 2asa^\p + ct^. 



= Vkas + (XcFs + (Jscr^p) + Oq, 
k2 = -Vkas + (Act| + asCF^p) + 6*0. 
By simple observation, we see Q,{t) converges monotonically to the value 
(3.5) e* = as^Jx''al + 2cJsa^\p + al - {Xal + 030 ^p) > 

as time t — )• +00, which we call as "steady state learning" (see also Brennan [3j). This convergence 
property of Cl{t) tells us the precision of the drift estimate goes from an initial condition to a steady 
state in the long time run, and after large time T, new return observations contribute to updating 
the estimated value of the state variable, but seldom reduce the variance of the estimation error. 
More precisely, by the evolution of Riccati ODE (j3.3p . we have the monotone solution 0.{t) on 
(0, 00) has the bounds 



(3.6) 



min(6lo, 6**) < n{t) < max(6'o, 9*), > 0. 



Under the observation filtration {J~^)o<t<T, we can instead rewrite stock price dynamics ()2.ip 
driven by the innovation process Wt as: 

(3.7) dSt = fitStdt + asStdWt, 0<t<T. 

Notice we are now seeking the optimal investment and consumption strategies ttj and ct which 
are only progressively measurable with respect to the partial observations filtration J^^ , where the 
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standard of living process Zt satisfies tlie ODE 

(3.8) dZt = {6{t)ct - a{t)Zt)dt, 0<t<T, 

and under the partial observations filtration , the wealth process dynamics (j2.4p under vrt and 
ct will be rewritten as: 

(3.9) dXt = {TTtfit - ct)dt + asTTtdWt, 0<t<T. 

Our goal now is to maximize the consumption with linear habit formation and terminal wealth 
by power utility preference under the partial observations filtration J^f: 



T 



Cs-Z,)P ^ {XtY 



-ds + 



(3.10) 7;(xo,2o,7?o,^'o) = sup E 

7T,ceA ^Jo P P 

where we take the risk aversion coefficient p < 1 and p ^ 0. 

Our aim is to provide an analytic solution of the control problem (|3.10|) using direct dynamic 
programming, i.e., first solve the Dynamic Programming equation analytically and then perform a 
rigorous verification argument. Therefore, there is no need to either define the value function at 
later times or to prove the Dynamic Programming Principle involving some complicated measurable 
selection arguments. 

Formally, at this level, we look for a smooth function v{t,x, z,ri,6) defined on an appropriate 
domain such that the process 

{cs - ZsY 



J 

Jo 



-ds + d{t,Xt,Zt,fit,nt), vtG[o,r], 



lo P 

is a local supermartingale for each admissible control (7rt,cj) G A and a local martingale for the 
optimal feedback control (7rJ',cJ') G A. The appropriate domain will be carefully defined later after 
we solve the associated HJB equation explicitly, moreover, some financial intuitions will also be 
clarified based on the domain of the solution. 

Furthermore, we recall that the conditional variance process Clt = ^{t, Oq) is actually a determin- 
istic function of time explicitly given by (|3.4p . We can therefore set the variable 8 in the definition 
of V by a deterministic function 6 = 9{t, Oq) depending on the parameter 6o to reduce the dimension 
of the function v, i.e., the variable 0(t;6Q) is absorbed by the variable t. Hence, we can define the 
function V{t,x, z,ri;6Q) as 

V{t, X, z, 7]; Oq) = v{t, X, z, r], 9{t, Oq)), 

and our target above can be simplified into finding a smooth enough function V{t,x, z,r];6Q) on 
some appropriate domain , denoted by V{t, x, z,rj) for simplicity, such that 



-ds + v{t,Xt,Zt,fit), yte[o,T] 



p 

is a local supermartingale for each admissible control (7rt,Cj) G A and a local martingale for the 
optimal feedback control (vr^, c^) £ A, for each fixed initial value r2(0) = ^o- 

An investment and consumption pair process (vrt,^) is said in the Admissible Control Space 
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A: if it is J-'j^-progressively measurable, and satisfies the integrability conditions: 

(3.11) / vr^dt < +00, a.s. and / ctdt < +00, a.s. 

Jo Jo 

with the addictive habits constraint that: q > Zt, Vt G [0, T]. Moreover, no bankruptcy is allowed, 

i.e., the investor's wealth remains nonnegative: Xt >0, < t < T. 

By the definition of V{t, x, z, rf) and Ito's formula, we can formally derive the Hamilton-Jacobi- 

Bellman (HJB) equation as: 

9.{t) + asCTfj^pj 



(3.12) 



Vt - a{t)zV, - A(r/ - fl)Vr, + 

\Pi r 1 



24 



-Vnn + max 



+ 



{c- z 



p 



+ max 



cV, + c5{t)V, 
0, 



with the terminal condition V(T,x, z,r]) = 



3.2. The Decoupled Reduced Form Solution. 

If V{t, X, z, rj) is smooth enough, the first order condition formally derives 



7r*{t,x,z,r]) -- 

(3.13) 4^^^ 



XT] 



c*{t,x,z,7j) = z+ (yx-6{t)V,^ 



1 

p-1 



which achieve the maximum over control policies vr and c respectively. 
Plugging forms of (j3.13p for vr* and c* , the HJB equation becomes: 



Vt - a{t)zV, - A(r/ - /2)K, + \ 2 2 T7 



XX 



{n{t) + asajjj}y_ _ ?7(r?(t) + a sCTf^p) VxVxr) 
(3.14) ^ ^ ^ ^ 

_ im+Y^prYk _ - ,(,)y.) -p^(vx- 5im) ^ = o. 

We expect that the smooth solution V{t, x, z, rj) of the HJB equation at time t = is actually 
the value function, i.e., y(0, xq, -zq, ^0) = '^{xq, ZQ.,rjQ.,9Q). Due to the homogeneity property of 
the power utility function and the linearity of dynamics (j3.9p and (j3.8h for Xt and Zt respectively, 
it's easy to see that if l/(t, x, z, r/) is finite, then it is homogeneous in {x,z) with degree p, i.e., for 
any rc > 0, z > and the positive constant /c, we have V{t, kx, kz, r/) = kPV{t, x, z, rj). It therefore 
makes sense for us to seek the value function of the form: 

(x — m{t, rj)z) 
V{t,x,z,ri) = —M{t,r]) 



P 

for some test functions m{t,r]) and M{t,rj) to be determined. By the virtue of V{T) = we will 
require MiT, rj) = 1 and m(T, r/) = 0. 

After we do the direct substitution in the above Equation (j3.14p and divide the equation on both 
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sides by (x — m{t,7])zy, the HJB equation becomes 

fit, m)z + \{ri - li)m^ - i^{^{t) + ascr^pfmj^r, + ^{^{t) + as<T^,p)m.n 



(3.15) 



p p 



X — m{t, r])z 
{VL{t) + asa^pf riinit) + agasp) 



■M 



2pai 



vv 



7] 



2(P-1K 



M 



2{p - l)o-|, M 



p 



(P - 1)4 
+ 5{t)m{t,ri) 



2 ^^7 



p 

p-i 



Mv'^ = 0. 



where we set 

/(t, m) = —mt + a{t)m — (1 + 5{t)m). 

Since the Equation (j3.15p above holds for all values of x and z, we can naturally set the unknown 
priori function m{t, r/) = m{t) as a deterministic function in time t and independent of the variable 
r] which satisfies: 

(3.16) fit, m) = -mtit) + a{t)mit) - (1 + (5(f)m(t)) = 
with the terminal condition ?7i(T) = 0, which is equivalent to: 

(3.17) m(t) = ^ ( / ^^^^-^ ~ aiv))dv^ ds. 0<t<T. 

Now we can substitute the function m(t) into the equation ()3.15p above, and simplify it as: 

2 



(3.18) 



Mt + 



+ 



prj" 



2(1 -P)4 



M + 



^4 ^ 



M^r, + (1 - p) ( 1 + S{t)mit) ) M 



(1 - p)4 



2(1 -p)4 



M 



Now in order to solve the above nonlinear PDE (|3.18p . we can set the power transform as 

Mit,v) = Nit,r,y-P 



(3.19) 

This idea of power transform was first introduced in Zariphopoulou 

And the nonlinear PDE (j3.18p for M(t, r]) reduces to the linear parabolic PDE for Nit, if) as: 



(3.20) 



Nt + 



+ 



pr] 



2(1 -P)M 



nit) + asCTf^p) . 

Nit, V) + —2 —^VV + ( 1 + ^(.tWit) ^ ' 



2at 



r][nit) +crsaf,p)p 

(i-p)4 



Nr^it,r,)=0 
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with N{T,r]) = 1. 

For the above hnear PDE (I3.20p of N{t,r]), we can further solve it expHcitly as: 



/_/ P 
(l + 5(s)VF(s)) ~ exp (^A{s, t)ri^ + B{s, t)rj + C{s, t)) ds 

+ exp (a{T, t)if + B{T, t)r] + C{T, tf 



(3.21) 



where we have for < t < s < T, A{s, t) = A{t; s), B{s, t) = B{t; s) and C{s, t) = C{t; s) satisfying 
the following ODEs: 



P 



(3.22) At{t) + ^. ^ ,^ „ + 2 - A + — ^(*) + 2 ^ (0 = 



2(1-P)V| L 4(1 -p) 



2 



4 



(3.23) B,{t) + - A + + 2XfiA{t) + ^iM±^W)!^(t)5(,) = O; 



4(1 -P) 



\ 2 

(3.24) a(t) + Xj2B{t) + ^ ^ i^B\t) + 2^(i) j = 0; 

with terminal conditions: A{s) = B(s) = C{s) = 0. 

We remark that the above ODEs are similar to the ODEs obtained by Brendle [2\ for terminal 
wealth optimization problem with partial observations, and he made an insightful observation that 
we can actually solve the above 3 ODEs with time t dependent coefficients by solving the following 
5 auxiliary ODEs with constant coefficients, see section 4 of Brendle [2\ for the detail proof. 

Theorem 3.1. For < t < s < T , consider the following auxiliary ODEs for a(t), h{t), c{t), f{t) 
and g{t): 

(3.25) at[t) = a a (t) + I 2A - ^}a{t) - — 

1-p ^ V (l-p)fTs/ 2(1 -p)a^g 

(3.26) hit) = -^^^^-P±P^alamt) - 2XMt) + (a - -^^^)b{t), 

l-p ^ V (l-p)o-5/ 

(3.27) ctit) = -ala{t) - ^-^-^^I^b\t) - Xpbit), 



(3.28) f,{t) = -2(1 - p')alf\t) + 2^^^±^f{t) + 

as 2a^ 



(3.29) g^(t) = al,{l-p'){fit)-ait)), 

with the terminal conditions a{s) = b{s) = c(s) = f{s) = g{s) = 0, and if we adopt the convention 
g = 0, then for the functions defined by: 

a{t) 



A{t;s) 



il-p)(l - 2a{t)n{t)^ ' 
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b{t) 



(1 - 2a{t)h{t) 



B{t;s) 



C{t- s) c{t) + ^^^^ ^b\t) - ^—^ log ( 1 - 2a{t)n{t] 



1 — p 



1 - 2a(i)fi(t) 



-|log(l-2/(t)A(t)) -pgit) 
we have the equivalence that: 

(3.30) A{t; s) = A{t; s), B{t; s) = B{t; s), C{t; s) = C{t; s), 0<t<s<T. 

Remark 3.1. The equivalence result (|3.30p reveals that solving the ODEs (|3.22p . (j3.23p . (j3.24p 
with variable coefficients is equivalent to solving the auxiliary ODEs (j3.25p . (|3.26p . (j3.27p . (|3.28p 
and (j3.29p with constant coefficients in such an order that we solve the Riccati ODE (j3.25p first, 
and substitute the solution a{t; s) into ODE (|3.26p and solve for the solution b{t; s), and etc. 

Actually, we can find fully explicit forms for a{t; s) , b{t;s), c{t;s), f{t;s) and g{t;s). We list 
all four different cases of fully explicit solutions in the Appendix depending on the risk aversion 
coefficient p and the market coefficients as, cr^, A and p. By simple substitutions, we can therefore 
solve the ODEs l!!3J2\i . UiSJ^ . (fSTM]) forA{t;s), B{t;s) and C{t;s) fully explicitly. 

Now, for t £ [0, T], rj E (— oo,+oo), we can define tlie effective domain for the pair {x,z) as: 

(3.31) {x,z) G = {{x',z') G (0, +oo) x [0,+oo); x' > m{t)z'}, 0<t<T, 
and the function 

V{t,x,z,ri)= (^l + 6{s)m{s)y''' exp(^A{s,t)7]'^ +B{s,t)r] + C{s,t)^ds 



(3.32) 



+ exp{A{T,t)r]'^ + B{T,t)rj + C{T,ty 



X — m{t)z 



p 



p 

is well defined on [0,T] x x M and it's the classical solution of the HJB equation (|3.12p . where 
m{t) = ex.p{J^ {6{v) — a{v))dv)ds, and A{s,t), B{s,t), C{s,t) are solutions of ODEs (j3.22p . 

Remark 3.2. In our main result below, we want to verify that the above classical solution V{t, x, z, rf) 
at timet = equals our primal value function defined in (j3.10p . i.e., V{0,xq, ZQ,r]Q;9Q) = v{xo, ZQ,riQ,9Q) 
However, the effective domain ofV{t,x,z,r]) motivates some constraints on the optimal wealth pro- 
cess and habit formation process . To wit, function V{t,x, z,r]) = — oo when x < m{t)z, 
which mandates that > m{t)Z^ for each t G [0, T] to ensure the process V{t, X^ , , fit) is well 
defined. In particular, when t = 0, we have to mandate the initial wealth-habit budget constraint 
that xq > m{0)zQ. 

3.3. The Main Result. 



Theorem 3.2 (The Verification Theorem). 

Build upon the initial wealth-habit budget constraint xq > m{0)zo, then either if risk aversion 
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constant 

p<0; 

or if 

< p < 1, together with market coefficients as, cr^, A, 0, p 

satisfy the additional assumption that explicit functions a{t;s), b{t;s), c{t;s), f{t;s) and g{t;s) 
defined in Theorem \3.1\ are bounded and 1 — a{t; s)^}{t) ^ onO<t<s<T (See Appendix \A\ for 
the detail discussion). Moreover, we assume 

(3.33) #±it < ''-'^ 



where G = maxj^O) ^*} o-f^d 0* is define in (j3.5p . And the upper bound Ki of A{t;s) on < t < 
s <T satisfies 



(3.34) 4Ki < ^ 



Then, the solution ()3.32p of HJB equation equals the value function defined in (|3.10p .' 

(3.35) y(0, xo, ^0, ^o; Oq) = v{xo, zo,rjo, 6o). 

And the optimal investment policy vr^ and optimal consumption policy c^ are given in the feedback 
form: 7r| = TT*{t, , , fit) and = c* {t, X* , , fit), 0<t<T, where the functions* {t,x, z,r]) : 
[0, T] X Dt X M — >R is defined by: 

(3.36) 7r*(t,x,z,7?)= \ 2 + - 2 -^j;^ {x-m{t)z), 0<t<T. 

L(l-P)f^5 N{t,ri)i 

c*{t,x,z,r]) : [0,r] X Dt X E — > E+ is defined by: 

(3.37) c*{t,x,z,r]) = z + - Mt)z) ^ o<t<T. 

{l + 5{t)m{t))^-pN{t,'n) 

And the optimal wealth process X* , for < t < T , is given explicitly by: 

(3.38) X; = (xo - m(O)zo)^^ exp ( /* , du + /* ^" dW^ + m(t)Z;, 

N{0,r]) \Jq 2(l-p)o-| 7o (l-p)o-s ^ 

where m{t) and N{t, rj) are defined in ()3.17p and (13.2ip respectively. 



Remark 3.3. The more complex structure of feedback forms of optimal investment and consump- 
tion policies is the consequence of the time non-separability of the instantaneous utility with habit 
formation. We can see the portfolio/wealth ratio w and consumption/wealth ratio -j^ are now 



depending on the habit- formation/wealth ratio ^: 



+ 



9.{t) +ascrfiP 



Nr,{t,fit) 



N{t,fLt) 



(1 



rn{t)^), 
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and 

c* _ 1 ^ / m{t) \ Z* 

^* {l + 6{t)m{t))^N{t,fit) ^ {l + 6{t)m{t))^N{t,fit)^ 
Moreover, although function c*(-,x, •,•) remains still linear and increasing in x > 0, c*{-,-,z,-) 
is not necessary increasing in z > 0, which shows the increase of initial habit dose not necessarily 
imply the increase of optimal consumption stream. And since the dependence of c*{t,x,z,r]) on the 
discounting factors a{t) and 6{t) are even more complicated, the optimal consumption process c^ is 
not necessarily monotone in the habit formation process . 



4. Proof of The Verification Theorem 

We will first show the consumption constraint q > Zt implies the constraint on the controlled 
wealth process by the following proposition: 

Proposition 4.1. The admissible space A is not empty if and only if the initial budget constraint 
with habit formation xq > m(0)zo is fulfilled. Moreover, for each pair of investment and consump- 
tion policy (vr, c) G A, the controlled wealth process X^''^ satisfies the constraint: 

(4.1) X^'" > m{t)Zt, 0<t<T, 

where the deterministic function m{t) is defined in ()3.16p and refers to the cost of subsistence 
consumption per unit of standard of living at time t. 

Proof. On one hand, let's assume xq > m(0)zQ, then we can always take vr^ = 0, and q = 
zq exp ^ Jo — a{v))dv^ for t £ [0, T], it is easy to verify X^''^ > and q = Zt so that (vr, c) E A, 
and hence A is not empty. 

On the other hand, starting from t = with the wealth xq and the standard of living zq, the 
addictive habits constraint ct > Zt, < t < T implies the consumption must always exceed the 
subsistence consumption ct = Z{t; ct) which satisfies 

(4.2) dct = {6{t) - a{t))ctdt, cq = zq, 0<t<T, 

Indeed, we first recall by the definition of Zt that dZt = {5tCt — atZt)dt with = z > 0, and the 
constraint that q > Zt implies 

(4.3) dZt > {5tZt - atZt)dt, Zq = zq. 
By the simple subtraction of (|4.3p and ()4.2p . one can get 

d{Zt - Ct) > {5t - at){Zt - ct)dt, Zq - cq = 0, 
from which we can derive that 

(4.4) e^o(^'-'^^)'i'{Zt - Ct) > 0, Vt G [0, T]. 
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And hence we can obtain q > q, which is equivalent to 

(4.5) ct> zoexp(^J^{6{v) -a{v))dvy 0<t<T. 

Define the exponential local martingale 
(4.6) 
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Ht = exp f - r ^dW^ - - /* ^dv ], 0<t<T. 
V Jo (Ts (^i 

Since fit follows the dynamics ()3.2p . which is 



At = e-*^7/ + /i(l - e~*^) + 







0-5 



dWu. 



similar to the proofs of Corollary 3.5.14 and Corollary 3.5.16 in Karatzas and Shreve |12] . Benes' 
condition implies H is a, true martingale with respect to (il, J-""^, P), 
Now define the probability measure P as 

dF 
dF 

Girsanov theorem states that 

rt 



Ht, 



Wt = Wt+ / —dv, 0<t<T 
Jo 0-5 

is a Brownian motion under (P, {J^f)o<t<T)- 

Then we can rewrite the wealth process dynamics as: 



Xt + / Cydv = X + n^asdWy, 
Jo Jo 

Since we have Xt > 0, it's easy to see that Tr^asdWy is a supermartingale under (r2,F'^,P), and 
take the expectation under P, we have: 

xq >¥. I c^dv 
Follow the inequality ()4.5p . we will further have: 



Xq > Zo^ 



exp 



{5{u) — a{u))du I dv 



Since 5{t) and a{t) are deterministic functions, we easily arrive xq > m{^)zo. 

In general, for Vt G [0,r], follow the same procedure, we can then take conditional expectation 
under filtration , and get 



Xt > ZtE 



exp 



{S{u) — a{u))du]dv 



again since 5{t),a{t) are deterministic, we obtain Xt > m{t)Zt, < t <T. 



□ 



Remark 4.1. The constraint on the controlled wealth process Xt and the habit formation process 
Zt agrees with the effective domain {{x,z) £ (0,oo) x [0,oo) : x > m{t)z} of the HJB equation for 



14 



XIANG YU 



the values of x and z. Aside from the consequence that the process V{t,Xt, Zt, fit) is therefore well 
defined, it plays a critical role in our following proof of the verification lemma. 

4.1. The Case p < 0. 

PROOF OF THEOREM^ 

First, for any pair of admissible control (iTtjCt) G A, Ito's lemma gives 



(4.7) 



d 



v{t,Xt,Zt,fLt) = g^^^'w{t,Xt,Zt,fit] 



dt + 



dW,. 



where we define the process X^, Zt,ftt) as 



Xt, Zt, fit) = Vt- a{t)ZtV, - Xifit - fi)Vr, + 



n{t) + asa^pj 



■Vr 



vv 



- CtVx + Ct5{t)Vz + ^'^^ + T^tll-t^x + \^<ysT^lyxx + yxrii^it) + OsOy^p^'Kf 

Recall V (t,x,z,r\) is the classical solution of HJB equation p.l2p . choose the localizing sequence 
r„, we integrate the equation (j4.7p on [0, t„ A T], and take the expectation, we have 



(4.8) 



y(0,xo,2o,r/o) > E 



(cg — Zg 

p 



-ds 



+ E 



V{Tn A T, Xr„AT, ^r„AT, At^At) 



Now, we follow the idea by Janecek and Sirbu [IT], let's fix this pair of control choice {iTt,Ct) £ 
A = AxQ, where we denote ^ the admissible space with initial endowment xq. And for Ve > 0, 
it is clear that Axq ^ Axo+e, and (vrt,ct) G Axq+e- Also it is clear that X, 



xo+e 
t 



Xt + e, <t <T. Follow the same procedure above, and notice process Zt keeps the same under 
the consumption policy q, then we can obtain: 



(4.9) 



V{0,xo + e,zo,r,o)>E 
+ E 



-ds 



P 



y{Tri A T, X^„AT + e, ^T„AT, ArnAT) 

By Monotone Convergence Theorem, we first know: 



(4.10) 



lim E 

n— >+oo 



r„AT 





= E 




p \ 







z,)p 



ds 



P 



For simplicity, let's denote Yt = [Xt — m{t)Zt ) , we know by definition (j3.32p that: 



1 



rl-p 



F(r„ A T, Xr^;,T + e, ^r„AT, A^^At) = -{Yr^AT + <^)''KaT- 

Proposition 14.11 gives Xt > m{t)Zt for < t < T under any admissible control pair (irtjCt), we 
know l^nAT + e > e > 0, \/0 < t < T. Since also p < 0, we will have 



(4.11) 



sup(y^„AT + e)^ <eP < +00. 



Now from Remark I^.H we already derived that A{t] s) < 0, VO < t < s < T. Combining this 
with the fact that m{s), 5{s) are continuous and hence bounded on [0,T] and when p < 0, we also 
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have 1 — a{t; s)Cl{t) > and 1 — f{t; s)Cl{t) > as well as a{t; s), b{t; s), c{t; s), f{t; s) and g{t; s) 
are all bounded for < t < s < T, we deduce that the explicit solutions B{t;s) and C{t;s) are 
both bounded onO<t<s<T, hence we have: 

^{0,1]) < ki exp{kr]) for some large constants k, ki > 1, 

which shows the existence of some constants k,ki > 1 such that 

1 / / \ \ i-p 



sup A'^^ < sup ( ki exp ( kfii 

" " ie[0,T]^ ^ 



< ki exp ( k sup fit ) • 

te[o,T] 



We recall that fit satisfies the Ornstein Uhlenbeck diffusion (|3.2p . which gives: 



fit = e-^^T] + fL{l - e-'^) + 



dWu. 



Hence, there exists positive constants I and > 1 large enough, such that: 

sup fit<l+ sup WWt, t G [0, T]. 
te[o,T] te[o,T] 

Using the distribution of running maximum of the Brownian motion, there exists some positive 
constants / > 1 and li such that 



(4.12) 



E 



supTV^^- 



i-p 

AT 



exp ( sup IWt 

te[o,T] 



< +00. 



At last, by the above (|4.1ip and (j4.12p . we can conclude that 



E 



sup V{Tn A r, Xr„AT + e, ^t„AT, At^At) 



< +00. 



By virtue of Dominated Convergence Theorem, we can deduce: 



hm E 

n— >-+oo 



V{Tn A T, Xr„AT + £, ^r„AT, f-T„AT] 



E 



(YT + €rN{T,ftT] 



E 


r(Xr + e)Pi 


> E 






L p J 




I p J 



Combine this with equation ()4.9p . and notice the pair of control (vr^, q) G A, we will see that: 

v{xo,zo,r]o,eo). 



y(0,xo + e, zo,?7o;6'o) > sup E 

Tr,ceA 



-ds + 
p p 



Notice V{t, X, z, rj; Oq) is continuous in variable x, and since e > is arbitrary, we can take the limit 



as: 



V{0, xo, zo,'no; Oo) = limy(0, xq + e, zo,r]o) > v{xo, zo,r]o, 0o). 

On the other hand, for vr^ and Cj defined by (|3.36p and (j3.37p respectively, we first want to show 
the SDE for wealth process: 



(4.13) 



dx; = (vr^i -cl)dt + asTTtdWt, 0<t<T, 



with initial condition xq > m{0)zQ has a unique strong solution and also satisfies X^ > m{t)Zf , Vt G 

[o,r]. 

Denote Y^* = X^ — m{t)Z^, and apply Ito's lemma and substitute vrj' as defined by (|3.36p . we 
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can get: 



(4.14) 



nlfLt - 4 - mt{t)Zl - m{t)5{t)ct + m{t)a{t)Zl 
- mt{t) + m{t)a{t)) - (1 + m{t)5{t))4 + 



dt + -KlasdWt 



at 



N 



dt + 



L(l -p)as 



+ 



0-5 



N 



Y,*dWt. 



Recall the definition of m{t) by (I3.16P and substitute defined by (I3.37|) into (14.141) above, we 
will further have 



dv; 



1 + 5{t)m{t) 



N 



+ 



+ 



L(l -p)as 



+ 



+ 



n{t) + as(T^,p) M 



N 



Y;dt 



as 



N 



Y^dWt. 



In order to solve in a more explicit formula, we define the auxiliary process by: 



Nit,fit) 
Y* 



VO < t < T. 



By Ito's lemma, we can derive the SDE for process as: 



(4.15) 



dTt 



N 



Nt - \{jlt - Jl)Nr, + 



24 



Pt[^{t) + (^SCTfj^PjP 



1 + 5{t)m{t) 



+ 



Pf^t 



-N 



dt + Vt 



(1 -p)as 



[l-p)al 



dWt. 



-Nr, 



{l-pfal 

Recall that N(t,rj) satisfies the linear PDE (j3.21|) . we can simplify (j4.15|) to be: 



dT 



t — 



2(l-p)V| 



dt + Tt 



(1 -p)as 



dWt. 



Hence, we can finally get the above SDE has a unique strong solution as: 



Toexp 



Pi 



2(1 -P)4 



du 



(1 -p)crs 



dWu], 



Initial condition T 







x(!^rn'{o)zo ^ ^ implies > 0, V < t < T. And, hence, we finally proved 



that the SDE (14.131) has a unique strong solution defined by (|3.38p and the solution satisfies 
the wealth process constraint (|4.ip 

Now, we proceed to verify vr^ and c* are actually in the admissible space A. 

First, by the definition (j3.36p and (j3.37p . it's clear that vr^ and are J^^ progressively measurable, 
and by the path continuity of = X^ — m{t)Z^ , hence, of vr^ and Cj , it's easy to show that: 



I {-nD'^dt < +00, and / cldt < +00, a.s. 
Jo Jo 
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Also, since X* > m{t)Z^, \/t G [0, T], by the definition of c^, we know the consumption constraint 
c* > , \/t £ [0, T] is satisfied. And hence (vrJ'jCi) G A. 

Given the pair of control policy (vr*, c^) as above, following the same steps and the definition of 
stopping time t„, instead of (j4.8p . we can now instead get the equality: 



V{0,xo,zo,rio;eo) 



E 



T„AT 



dt 



P 



+ E 



V{Tn A T, X*^^rp, Z*^^rp, fir„ 



AT) 



And hence, we apply Monotone Convergence Theorem again: 



hm E 

n— >-+oo 



T„AT 





= E 




p J 







{c*t-Zt)P 
P 



dt 



When p < 0, we have function V{t, x, z^rf) < by it's definition, and by Fatou's lemma. 



y{'T'nf\T,X*T,Z*T,fl 



lim supE 

n— >+oo 

Therefore, it gives 

y(0,xo,^;o,??o;6'o) < E 
which completes the proof. 



r„ATJ 



< E 



V{T,X*T,Z*T,fiT] 



E 



P 



T 



(.4 



P 



P 



< v{xo,zo,r]o, 



□ 



4.2. The Case: < p < 1 . We proceed to prove the following two Lemmas which play important 
roles in the proof of the second part of our main result. 



Lemma 4.1. // constant k > satisfies: 
(4.16) k < 



2(9 + asa^p)^ 

for any t >0, there exists a constant Ai such that 

ft 



E 



exp 



kjllds 



< Ai < +00. 



Proof. Similar to the proof of Lemma 12 of Fleming and Pang [8], it is easy to choose an increasing 



sequence of smooth functions Qn{y) as n — )• oo such that < Qn{y) < n with Q'niv) 



and 



uniformly bounded. And for each fixed t > and r], we define: 



(f){t,r])=E exp ( I Qn{fls)ds 



where (xq = rj. 

Similar to the proof of Feynman-Kac formula, the function (j){t,r]) is a classical solution of the 
linear parabolic equation: 



(4.17) 



2a| 
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with initial condition (f)(0,rj) = 1. See also Lemma 1.12 in Pang [18] for details. 

First, it's clear that constant is a subsolution of the above equation. Moreover, under assump- 
tion ()4.16p . it's easy to show that for each fixed t > 0, the equation: 

2 



has two positive real roots 



4 



-x^ -2Xx + k = 



V , V 
xi = , and xo = ■ 

z H Z 

And for any positive constant a such that: 



° ^ " ^ 2(ei+<xs<TMP)2 ' 
with ©1 = max(0,^*) and 02 = min(0,0*), and the positive constant b such that: 

(Bl + CJ5(T^/9)2 A2/i2a2 



b > a- 



4 2a2(®i+^f^-2aA + /t' 

It's easy to verify that f{t,ri) = exp{bt + ar]"^) satisfies: 



2 



2 



ft > ^-^ fm - ^iv - fJ')fv + 

lag 

with the initial condition f{0,ri) > 1. 

And since Qniv) < ^^^! g^t function f{t,r]) is the supersolution of the equation (j4.17p . and 
(^0, f{t,7])^ is the coupled subsolution and supersolution. Theorem 7.2 from Pao [TU] shows that 
function (j){t,rj) satisfies: < (/>(t, < fit,i]) = Ai, and hence Monotone Convergence Theorem 
leads to: 



E 



exp ( / kfi^gds 



< Ai < +CX). 

□ 

Lemma 4.2. // constant k > satisfies 

T Act?. 

for fixed constant k> 0, there exists a constant A2 independent oft, and 

E exp (^k{fit + < A2 < 00, t G [0,r]. 

Proof. Similar to the proof of Lemma 14. 1^ we again construct an increasing sequence of functions 
{Qn{y)} for n E N such that lim„_j._(_oo Qniu) = k{y + k)"^. And for each fixed t £ [0, T] and r], we 
define: 

= E exp (Qnifit) 
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where ftQ = rj. 

Then a direct corollary of Theorem 5.6.1 of Friedman [9] gives the function 'ip{t,rj) is a classical 
solution of the linear parabolic equation: 

(4.19) 11^1 = - - A(r/ - /i)V'r„ 

with initial condition ^(0, ?]) = e*^"*^''^. 

Under assumption (|4.18p . and choose any constant a such that 

Aa2 



k < a < 



's 



where xi = - , , ^"^^ -tt is one real root of the algebraic equation: 



-X 



2 - 2Ax = 



for each fixed t G [0, T]. And choose any positive constant b such that 

4 4 



2a2 (e+<xyMP)^ Z^X 



It is easy to verify that /(t, 77) = exp(6t + a(r/ + k)^) satisfies 



2 



ft > ^ fvv - - P)fr„ 

with the initial condition f{0,r]) = e"^''~*'''^^ > '0(0,??), hence we get the function f{t,7f) is the 
supersolution of the equation (|4.19p . and it is trivial to show g{t,rj) = is the subsolution, there- 
fore (0, /(t,?7)) are the coupled subsolution and supersolution. Again by Theorem 7.2 from Pao 
[19] . that function V(i,^) satisfies: < i'{t,ri) < f{t,7]) < e''^+"'(''+'")' = A2, hence Monotone 
Convergence Theorem implies: 



E 



exp (^k{pt + < A2 < +00, yt G [o,r]. 

□ 



PROOF OF THEOREM^ CONTINUED. 

For any pair of admissible control (7rt,ct) E A, similar to the case for p < 0, choose the same 
localizing sequence such that 



(4.20) 



V{0,xo,zo,'no) > IE 



-ds 



P 



+ E 



V{Tn A T, Xr„AT, ^t„AT, At„At) 
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Now, by monotone convergence theorem, we first know: 



lim E 



r„AT 



as 






= E 




p J 







-ds 



P 



And for < p < 1, V{t,x,z,7]) > for all t G [0,T] by the definition (13:32]) and (jUD, and 
Fatou's lemma yields that: 



lim E 



V{Tn A r, Xr^/^T, ^r„AT, Ar^AT; 



> E 



V{T,XT,ZT,fLT) 







= E 






I p J 



which implies that: 



y(0, j;o,2;o,??o) > sup E 



p 



p 



v{xo,zo,rio,So)- 



On the other hand, for the vrj" and defined by (|3.36p . (|3.37p . again follow the same procedure 
in the proof for case p < 0, we can show vrj' and are actually in the admissible space A. 
Now, by policies vr^ and c*, similarly, we can now get the equality: 



E 



-ds 



p 



+ E 



y(r„ A T,X* .r^, Z*ATi At„at) 



By the definition V{t, x, z,rj), we know that: 



V{T A Tn,X*T^,^,Z^^,^,fiTArJ < h [f^ — ) ^^^^ +N'{TA T„, ^ta 

for some positive constants ki, which are independent of n. 



For the first term, we notice that 



Y*\2P 1 



exp 



exp 



TAt„ 



(1 -p)as 

TAr„2(p2+p)^2 



TAt„ 



and hence, we have 

'Y*\2p 

E 



sup , , 

„ V iv y tat 



<-E 
-2 



sup exp 

n 



(i-p)M 



+ sup exp 



(1 -p)as 

TAr„2{p^+p)f,l 



dWu 
du 



2, -.2 



'ip'^jl: 



[l-pfal 



■du 



and again since fit follows the dynamics (j3.2p . by Benes' condition (see Corollary 3.5.14 and 
Corollary 3.5.16 in Karatzas and Shreve [l2]), we see that the exponential local martingale Mj = 



du] is a true martingale, and hence, by Doob's maximal inequality. 
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we first derive that 



E 
<E 



sup exp 



sup exp 

te[o,T] 



TAt„ 



(1 -p)as 



dWu 



TAt„ 



exp 



(1 - p)(Ts 



dWu 



{l-pYal 

(i-p)M 



du 



du 



^Pfj-u 



-dWu 



Ap^fil 



du 



where k{p) is a constant depending on p. Moreover, similar to the proofs of Corollary 3.5.14 
and Corollary 3.5.16 in Kamtzas and Shreve [12], Corollary 1 and Corollary 2 in Grigelionis and 
Mackevicius |10) further states that the true martingale Mt defined as above satisfies the finite 
moments property, i.e., for any r > 1, we have E 



< oo. Hence we can conclude that for r = 2, 



E 



exp 



T 



4p/i« 



-dWu 



Ap'^fil 



(l-p)V| 



du 



< oo. 



lo {l-p)crs 

For the second part, we can therefore apply Assumption (j3.33p and Lemma 14.11 and it yields 
that: 



E 



sup exp 



TAt„ 



i2p^ + 2p)fLl 

(i-p)M 



du 



< E 



exp 



(2p2 + 2p)A^ 
(l-p)V| 



du 



< Ai < +00, 

for some constant Ai > 0. 

We now recall that under the assumption that a(t; s), b{t; s), c{t; s), f{t; s) and g{t; s) defined in 
Theorem 13.11 are bounded and 1 — a{t; s)0.{t) 7^ on < t < s < T, functions A{t; s), B(t; s) and 
C{t; s) are bounded onO<t<s<T and therefore there exists constants k, ki such that 

N{t,r]) < /fce-^i('?+'=i)', 

where A{t; s) < Ki for all < t < s < T, and hence we have 

sup(iV2(rAr„,/iTArj) < sup fce2^i('^*+'=i)'. 

n ^ ^ t&[0,T] 



Then we just need to show that 
(4.21) E 



sup ke^K^(f^'+k,f 
te[o,T] 



< CXD. 



Define f{x) = ^"^^^(^+^1)^ and apply Ito's lemma, we have 



(71 



+ 2Ki 



^{t) + (7s(7f,p 



dt + dLt. 
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Assumption (j3.34p guarantees —AKiX + 8Kf 
bound constant k2 > such that 



< 0, and hence there exists an upper 



where the local martingale part is: 



dLt = ip{fit)4:Kifit 
From which we can derive that 
E 



<7S 



sup (p{fit) < ip{r]) + / A;2E sup ip{fi, 
te[o,T] J Jo ^sG[o,t] 



dWt. 



dt + E 



sup Lt 

46[0,T] 



Burhholder-Davis-Gundy Inequality and Jensen's Inequality induce that 











E 


sup Lt 


<.,,(£e 
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Helo,T] J 







,^2g4/?i(At+fei)2 



dt 



_ Act 

However, under Assumption ()3.34p . there exists a constant e > such that AKi + e < ^q_^^^^ 
and by Holder's Inequality, choose the conjugates q = ^^^'^ and | + ^ = 1, then 



E 



^2g4Ki(At+fci)2 



< E 



2q' 



E 



,(4Ki+e)(At+fci)2 



and by Lemma 14.21 there exists a constant A2 independent of t such that 



E 



,{4Ki+£){/lt+fcl)2 



< A2 < 00, vt G [o,r]. 



And again by the fact that there exists positive constants / and li > 1 large enough, such that: 



sup ilt<l+ sup hWt, t G [0,T], 
te[o,T] ie[o,T] 



we also obtain 



E 



2q' 



dt<TE 



{I + sup hWtf'^' 

te[o,T] 



< 00, 



due to the distribution of running maximum of the Brownian motion Wt- Hence we get the 



boundedness of E 



sup Lt 

te[o,T] 



< A;4 < 00 for some constant k4, and 



E 


sup ip{flt) 




sup ififls) 


dt + ki 




4e[0,T] J 


Jo 







The Gronwall's Inequality verifies (|4.2ip . 

Therefore, putting all pieces together, we eventually derived that 



(4.22) 



E 



sup V{Tn A T, Xr„AT, Zr^AT, At^At) 



< 00 
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and Dominated Convergence Theorem leads to: 



lim E 



V{Tn A T, Xr„AT, ^t„AT, fl-TnAT) 



E 



p 



Together with Monotone Convergence Theorem, we deduce 



V{0,xo,zo,r]o;eo) 
which completes the proof. 



E 



p 



< v{xo,zo,m,do), 
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Appendix A. Fully Explicit Solutions to the Auxiliary ODEs 



Follow the arguments by Kim and Omberg |13] . we can even solve the auxiliary ODEs (j3.25p . 
()3.26p . ()3.27p . ()3.28p and ()3.29p fully explicitly depending on the risk aversion constant p and all 
the market coefficients as, (y^-, A, p: 



A.l. The Normal Solution. The condition for the Normal solution is 

A ^2 2Appcr^ 



(A.l) 
and then we define: 



P<^1 



72 = -A + 



{1-p + pp^) 2 

1 — p ^ 
P 



72 - 7173, 
(1 -p)as' 



73 



(1 - p)al ' 



(1 - p2)a2 + {Xas + pa^y 



0-5 



We can solve the equations K2Eh . K27h . (lOHD and ([M]) as: 

a{t;s)- ' 



h{t-s) 



c{t;s) 



2(1 - p)4 [2e - (e + 72)(1 - e2C(*--)) 
pXfl{l - e«(*-^))2 



2(l-p)4e 2e-(e + 72)(l-e25(*-^)) 



p 



2 ,-.2 



2(1 -pvu e e+72 



2(1 - p)ai.{e - li) 



log 



{s-t) + 



2e-(^ + 72)(l-e2«*-)) 



pX^p^ 


(^ + 272) 


em-s) _ 472e«(*~^) + 272 - 




2(1 


-pVse 


2C-(C + 72)(l-e2«(*--))" 





2^ 
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1 _ e26(t-s) 



2a(i-s) 



9{ ,s) og ^^ 2cjs^ie5i(*-^) 

(1 - - p') / (a5g + Aa5 + ^ ) + (a^e - Aa5 - ^)e2«*-) 



2(1 -p + pp2) 
p2A(s - t) 



2(1 -p + pp^) 2{1 -p + pp^)as' 
The condition for the bounded Normal solution is 
(A.2) 73 > 0, or 71 > 0, or 72 < 0. 

The condition for the explosive solution and the critical point is 

73 < 0, 71 < 0, and 72 > 0, 

Remark A.l. By the observation, if p < 0, the conditions l\A.l\i and l\A.2\i hold, and we have 
CL{t;s) < is a bounded solution as well as 1 — 2a{t;s)Cl{t) > 1 > and 1 — /(t; s)fl(t) > 1 > 0, 
hence we can finally conclude the solutions of ODEs (j3.22p . (|3.23p . (j3.24p are all bounded on 



<t < s <T. We also notice that A{t) 



a{t) 



< 0, onO<t<s<T. 



{l-p){l-2a(t)n{t)) 

A.2. The Hyperbolic Solution. The condition for the Hyperbolic solution is 

A x2 2Appcr^ 



(l-p) 



0, 



together with 



72 



-A + 



(1 -p)as 

Then we can solve l^2B, (1226]), 1^^:27} . (f3:28]l and (Km as: 

-1 



a{t; s) 
b{t;s) 



72_ 

^27i(s-i-:^) 271 

2Xp 

47172(5 -i - j^) 



72A/x(g -t + ^J 
271 



72a2(.-t) X'-p'^Us-t-±){s-tr a^log l(.-t)72-l 
c{t; s) = ^ + ^-^ ^^^^ + 



271 



fit;s) 



2471 (s -t- 

1 _ p26(i-s) 



72 ' 



71 



2o-s (0-5^1 + Xas + pcjfj,) + (0-5^1 - Xas - p(Ty)e'^^^^^ ^) ' 



{Xas + P(^^i) 
2as 



(s-t) 



+ 



^1(1 - P') 



271 



log 



1 + 72(^-5) -72{s-t) 
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The condition for the bounded Hyperbohc solution is 

72 < 0. 

The condition for the explosive solution and the critical point is 



1 

72 > 0, and s — t = — . 

72 

A. 3. The Polynomial solution. The condition for the Polynomial solution is 

(l-p)4 

together with 

72 = -A + 7- ^ = 0, 

{l-p)as 

Then we can solve ([325]), ([326]), ^^?TJ\ . (13:28]) and (f3:29]l as: 

2(1 

2(1 -Pjf^s 



/(t;s) 
5(i;s) 



1 1 _ e2Ci(i-s) 

2crs (0-56 + Aas + /9(T^) + ((75^1 - Ac75 - pcr^)e2?i(*-^) ' 
_lj^ / (tTg6 + Ao-s + po-^) + (0-56 - Acjg - po-^)e^g^(*~'^) \ (Ao-5 + pa^) 



4(1 ^- 

All Polynomial solutions are bounded. 



A. 4. The Tangent solution. The condition for the Tangent solution is 



A^A^ 



P^l 



(l-p)4 (1-pVP'' 



Now, we define 



C = \/— A, = tan 



Then we can solve (IX^ . (IX^ . (IXTTH . (I22SI) and ([321 as: 

«(*; tan (((s - t) + - 

271 V / 271 

h{t; s) =— — 1 — tan(OT) tan(C(s — t) + w) + sec{w) sec(C(s — t) + ro) 
71 L 
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c{t; s) = 



271C 

A^A^(272 + C^ 
27iC^ 



+ 



sec {-uj) — sec {({s — t) + w) 



tan(^(s — t) + w) — tan(ci7) 



fit;s)=- 



A^A^(7l + C^)-72CVg , , ( < . 

1^^2 + ^ sec(ro) cos(C(s - + ro)J , 

_\ 1 _ e2gi(t-.) 

20-5 {(7si\ + Acts + pcr^) + (cr^^i - \as - /9cr^)e2€i(*-«) ' 



1 

2^ 



■log 



cos(("(t — s) + w) 



2as 



COs(ci7) 



72 

271 



t) 



All Tangent solutions are explosive solutions and the critical point is 
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